What to do about bad language on the internet

نویسنده

  • Jacob Eisenstein
چکیده

The rise of social media has brought computational linguistics in ever-closer contact with bad language: text that defies our expectations about vocabulary, spelling, and syntax. This paper surveys the landscape of bad language, and offers a critical review of the NLP community’s response, which has largely followed two paths: normalization and domain adaptation. Each approach is evaluated in the context of theoretical and empirical work on computer-mediated communication. In addition, the paper presents a quantitative analysis of the lexical diversity of social media text, and its relationship to other corpora.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Why we need to read and understand literature: literariness and Hans Rosling’s Factfulness (2018)

My article addresses the qualities of “good” literature and how an understanding of the nature of literary devices, so-called “literariness”, can enhance the reading experience. Focusing on Hans Rosling’s Factfulness (2018), I discuss some of the most important features of good writing. Six literary devices have been selected for special attention: point of view, tone, amplification, anecdotes,...

متن کامل

Computer security in the future

Until recently, computer security was an obscure discipline that seemed to have little relevance to everyday life. With the rapid growth of the Internet, e-commerce, and the widespread use of computers, computer security touches almost all aspects of daily life and all parts of society. Even those who do not use computers have information about them stored on computers. This paper reviews some ...

متن کامل

Survey on the Status of Persian-Language Health Services through the Internet

Abstract Background: The Internet has been able to convert the manner of information seeking and has changed the users’ approach to information particularly in health domain. In this regard, the number of Persian-language websites in health service are increasing. Therefore, information about the variety of services offered by them is very important. The present study was designed to describe ...

متن کامل

What Do Iranian EFL Learners and Teachers Think of Teaching Impoliteness?

Every language involves friendly and polite as well as hostile and impolite situationsin which language users have to use the context-appropriate language. However,unlike politeness which has generated a great number of studies, few studies havebeen conducted on impoliteness especially in EFL contexts. The present study aimedto see whether language learners and teachers hold the same idea conce...

متن کامل

Validating an English Language Teacher Professional Development Scale in Iranian EFL Context

Although decades of research have well elaborated on teacher professional development, we still do not have a thorough picture about what teacher professional development could entail and what components it consists of. The present study aims to develop and validate a teacher professional development scale in an Iranian English foreign language context. An initial tentative model with 130 items...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013